5 Concluding Remarks and Challenges

نویسندگان

  • G H John
  • R Kohavi
  • K Pfleger
  • Morgan Kaufmann
  • A J Katz
  • M T Gately
  • D R Collins
چکیده

In this paper we have discussed an approach for discovering some useful knowledge from large amounts of data that are manually and automatically generated during maintenance and operation of commercial aircraft. We have discussed several important issues in analyzing data in this domain. These issues were related to: data format, data complexity, domain information, and presence of contexts. We introduced a knowledge discovery approach that we have developed for this real world application. This approach consists of four steps: (i) identification of the relevant sources of information, (ii) exploration of the selected relevant data, (iii) sampling and data transformation , and (iv) modelling. All of these steps are guided by a specific investigation problem which has to be formulated with the help of domain experts before starting the analysis. The proposed approach helps to guide the analysis through the application of diverse discovery techniques. Such a methodological procedure will help us to address the complexity of the domain considered and therefore optimized our chance to discover valuable information. We presented preliminary results that plausibly confirmed this hypothesis but more experiments are clearly required. During the presentation of the approach, we raised several difficulties that have to be addressed to successfully apply machine learning algorithms in complex real world domains. Among others, we noted the problems related to: the labelling of the instances, the selection of the relevant data, and the use of contextual information. In a long term project, such as the one described here, it may also be very important to address the following three issues: (i) finding an automatic approach to define relevant investigation problems for this domain, (ii) developing tools that are necessary to disseminate discovered knowledge to the end users, and (iii) automating most of the tasks involved in data mining process. Acknowledgments Thanks to Chris Sowerby and Francis Ruest from Air Can-ada to provide domain information and helped us during the evaluation of the results. Thanks to three anonymous referees of the Workshop for their comments on an earlier version of this paper. labelling, and ii) the labelling procedure may be very difficult to automate since it typically involves considerable amount of expertise in the target domain. On the other hand, the complexity of the labelling process may depend on the sampling strategy. More research work on sampling strategies for large data sets [Musick, Catlett, and Russel 1993] is therefore also important. As shown in …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Concluding Remarks of the Second International Congress on Traditional Medicne & Materia Medica, 4-7 oct. 2004, tehran, iran

Plants are one of the great resources of the world, and as humankind has evolved, socially, spiritually, and economically, we have found, collectively, a myriad uses for the plants around us. One of those uses is for the prevention, treatment, and cure of various disease states. Documented knowledge about such use dates back at least 4000 years, and several of the plant mentioned in the ancie...

متن کامل

Concluding Remarks of the Second International Congress on Traditional Medicne & Materia Medica, 4-7 oct. 2004, tehran, iran

Plants are one of the great resources of the world, and as humankind has evolved, socially, spiritually, and economically, we have found, collectively, a myriad uses for the plants around us. One of those uses is for the prevention, treatment, and cure of various disease states. Documented knowledge about such use dates back at least 4000 years, and several of the plant mentioned in the ancie...

متن کامل

Specifying Railway Interlocking SystemsThis research is funded by Westinghouse Rail Systems, Chippenham, UK

One of the Grand Challenges in Computer Science is to verify railway interlocking systems [1]. We give a generic datatype of control tables and ladder logic (2,3), and extract from these verification conditions (4). A proof of the correctness of these conditions is performed using induction and a datatype of reachable states (5). Finally, some concluding remarks are presented (6). This specific...

متن کامل

ar X iv : n uc l - th / 9 30 20 03 v 1 5 F eb 1 99 3 ISN 93 - 17 SPIN AND FLAVOUR : CONCLUDING REMARKS 1

We review some of the salient results presented at this Workshop, together with some comments on the underlying physics, and the proposed facilities for future experiments. ISN 93–17 February 5, 2008. Invited talk at the Workshop on Spin and Flavour in Hadronic and Electromagnetic Interactions, Turin, September 1992, to appear in the Proceedings SPIN AND FLAVOUR: CONCLUDING REMARKS

متن کامل

Decentralisation and Community-based Natural Resource Management in Tanzania. – The Case of Local Governance and Community-based Conservation in Districts around the Selous Game Reserve

....................................................................................................................................... iii Abbreviations ................................................................................................................................ iv INTRODUCTION .....................................................................................................

متن کامل

CONTENTS CONTENTS 5 Concluding Remarks

2 Requirements for Ground Simulation 5 2.1 Similarity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.1.1 General Considerations . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.1.2 Blunt Body Flows . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2.2 Power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2.3 Instrumentation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997